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Abstract 

This paper provides analysis to a generalized version of the coupon collector 
problem, in which the collector gets d coupons each run and he chooses the one 
that he has the least so far. In the asymptotic case when the number of coupons 
n goes to infinity, we show that on average - lo £ n + 5 (m — 1) log log n + 0(mn) 
runs are needed to collect m sets of coupons. An efficient exact algorithm is 
also developed for any finite case to compute the average needed runs exactly. 

> 

qq ■ Numerical examples are provided to verify our theoretical predictions. 
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1. Introduction 

The classic coupon collector problem asks for the expected number of runs to collect 
a complete set of n different coupons while during each run the collector randomly gets 
a coupon. The answer is nH n where H n = Y^k=i \ 1S ^ ne harmonic number [2 . One 
can further ask for the expected number of runs to collect m complete sets of coupons, 
which has been addressed by Newman and Shepp [5]. 

The coupon collector problem and its variants are of traditional and recurrent 
interest [3j [5] jJH 01 [7] . Besides of their rich theoretical structures and implications, there 
are various applications of them as well including dynamic resource allocation, hashing 
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and online load balancing Q], to just name a few. In particular, we want to point 
out that these problems also serve as basic models to analyze delay for opportunistic 
scheduling in broadcast wireless fading channels [S]. For example, to maximize system 
throughput, we should serve to the user whose channel condition is the best at every 
time slot. In order to evaluate performance, one may ask about the expected number 
of time slots needed for all users to be served at least once. Assuming all channels are 
i.i.d., then this is equivalent to the classic coupon collector problem. 

In this paper, we investigate a natural generalization of the coupon collector prob- 
lem. Instead of getting one coupon, the collector gets d (1 < d < n) distinct coupons 
randomly each run and he picks one that so far he has the least. Formally, we denote 
the number of runs (a "run" often referred to a unit of "time slot" or simply a unit of 
"time" in this paper) that are used to collect m sets of coupons as n and we are 
interested in characterizing the mean value of this random variable n especially 
in the asymptotic region when n is large. Clearly, when d = 1, we go back to the 
classic cases; when d = n, there is no randomness and n — mn. In the scheduling 
transmission context we just discussed, d can be viewed as a parameter to control the 
tradeoff between efficiency (high throughput) and fairness among all users with d = 1 
purely focusing on efficiency while d = n giving out perfect fairness. 

In the remaining part of this paper, we first briefly review existing related results in 
Section [5] Although they are all special cases of the general problem, the techniques 
used to derive them cannot be applied directly to the general case. Instead, we develop 
a new technique to characterize EiD^ n ) and provide upper and lower bounds for 
^(-^mn) m Section [3] and Section @] An asymptotic analysis shows that the upper 
bound and lower bound match in the asymptotic regime of n — > oo in Section [3] . 
Furthermore, for any finite n, an algorithm is motivated and proposed in Section [5] 
to calculate E(Df n n ) exactly. Finally, we use numerical examples to validate our 
theoretical predictions in section [7] 

2. Existing Results 

The existing results on special cases are listed below. If d = 1 , then the problem is 
solved for all to > 1. If d > 1, then only the to = 1 case is known. 
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• d = 1, m = 1 ([2])- It is clear that the number of runs needed to obtain (i + 
l)-th coupon after obtaining the i-th one follows a geometric distribution with 
parameter — -. Therefore, 



n 1 

E(Dl n ) = nH n = nJ2 1 : (1) 

For large n, 



k 

k=\ 



E{D\ n ) = nlogn + nO{\) (2) 
One sees that the randomness cost is expressed approximately by a factor logn. 

• d = l,m > 1 (0). 

E(Dlj = n (l-(l-S m (t)e-*) n )dt (3) 
Jo 

where S 1 ™^) = ^^o 1 H- 
For fixed m and large n, 

E ( D ln,n) = nlogn + n(m - 1) loglogn + nO(l). (4) 

It is interesting to note that although collecting the first set of coupons needs 
about nlogn runs, all later sets only need n log logn runs per set. 

• d > l,m = 1 ([S]) • This has applications in the scheduling of data packets 
transmission over wireless channels. Here 



n—i -. 

E(Df) = £ M , (5) 



where Q) = if i < d. 
For fixed m and large n, 



logn. (6) 



This shows that for the m = 1 case, allowing choosing <i coupons randomly each 
time decreases the expected number of runs and most of the decreases occurs 
from d = 1 to d = 2. 

• d > 1, m > 1. In the mentioned context of scheduling, if a transmitter wants to 
send m packets to each of the n users, but each time he can only transmits 
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one packet to one user chosen from the d users who have the best wireless 
communication channels. Due to the time varying nature of the wireless channels, 
it is natural to assume that for each time index, the d users who have the 
best communication channels are uniformly distributed among the n users. So 
gives an estimate on the total delay in delivering these m packets, and, 
in this paper, we will offer a characterization of E(D^ n ). 

3. Lower Bound on E{Df nn ) 

We will first lower bound E^D^ n ) by considering a different coupon collecting 
process. In this new process, each time we uniformly select d distinct coupons out of n 
coupons, and instead of keeping only one coupon out of these d selected coupons, we 
would keep all these d coupons. Apparently, the expected time of collecting m sets of 
coupons in this way will be no larger than the process which only keeps one coupon a 
time. 

However, it is not so straightforward to directly get a estimate for this new process. 
This motivates us to consider another process in which each time, one will collect d 
uniformly, independently chosen (allowing repeating) coupons and keep all of them. 
This process stops when m sets of coupons are fully collected. 

Lemma 1. Let t\ be the expected time to collected m sets of coupons for the process 
in which each time d uniformly chosen distinct coupons are kept. Let i 2 be the expected 
time to collected m sets of coupons for the process in which each time d uniformly 
chosen (allowing repetition) coupons are kept. Then 

*i ^ ( 7 ) 

Proof. We simulate the process of choosing d distinct coupons through an expur- 
gated process of choosing d independent coupons (allowing repetition) . If the d coupons 
we independently choose (allowing repeating) are not distinct, we will discard this 
group of d coupons; if they are all distinct d coupons, we will keep them. The kept 
coupons from expurgated process follow the same distribution as the chosen d distinct 
coupons. However, the expected time for one to get a group of d distinct coupons is 
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clearly j^-. So in the worst case, ti < fk\ti. □ 
(d) (d) 

In summary, in order to give a lower bound on E^D^ n ), we will first need a lower 
bound on ti for the process of keeping d uniformly randomly chosen coupons (allowing 
reptition). To do this, we follow the approach of generating functions in [5]. 

Let Pi be the probability of failure of obtaining m sets of coupons when we have 
kept i coupons. Let P Xl ... lX „ be a power series and let {P Xl ... >Xn } be the power series 
when all terms having all exponents > m have been removed. By these notations, 

oo 
3=0 

and 

with X\, • • • , x n all equal to 1. 
In addition, we know 

d—X oo 



Lt— X W n OO 

„-n ,-n JO 



(i)e-*) n )rf«, 



<Z=0 J=0 

where S m (i) = E™" 1 S 0- 

We also notice that Pi is nonincreasing as i grows, so 

oo oo 
3=0 3=0 



> n\ (l-(l-S m (t)e-T)dt/d. 

So by 0, we know 



E(D d m J > S (E(D^ n )/d) 



4. Upper Bound on E{Df nn ) 

In this section, we will upper bound the expected time of collecting m complete sets 
of coupons. To achieve this, we will upper bound the expected time for collecting m 
complete sets of coupons in a suboptimal process. In this new process, each time, we 
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will uniformly and independently choose d coupons (allowing repetition). Among this 
group of d coupons, we will start looking at them one by one. If the i-th (1 < i < d) 
coupon is the first such a coupon that we so far have fewer than m copies, then we will 
keep this i-th coupon and discard the remaining (d — i) coupons. 

First of all, we observe that d distinct coupons are favorable in terms of minimizing 
the collection time compared with d coupons with possible repeating. 

Theorem 1. The minimized expected time of collecting m sets of coupons, when each 
time the coupon collector is given d uniformly chosen distinct coupons but is only 
allowed to keep 1 coupon, is no bigger than the minimized expected time of collecting 
m set of coupons, when each time the coupon collector is given d uniformly chosen 
coupons (allowing repeating) but is only allowed to keep 1 coupon. 

Proof. Apparently, when the coupon collector is given d distinct coupons, he has 
more choices in making his decisions. □ 

Secondly, we show that it is an optimal strategy for the coupon collector to keep the 
coupon out of the d incoming coupons (whether allowing repetition or not), for which 
he has the fewest copies. 

Theorem 2. The expected time of collecting m sets of coupons is minimized when 
each time, the coupon collector keeps the coupon which he has the fewest so far, if the 
coupon collector is allowed to keep only 1 out of the d offered coupons. 

Proof. Suppose (before the coupon collector finishes collecting all m sets of coupons) 
among the uniformly chosen d (either distinct or allowing repeating) coupons, the j-th 
type of coupon (there are in total n types of coupons and 1 < j < n) is what he has 
the fewest, say c\ copies. Suppose further that he chooses instead to keep a different 
type of coupon, say the Z-th (1 < I < n, I ^ j) type of coupon, and, for this type of 
coupon, the coupon collector has already got c 2 copies, where c 2 > c\. Immediately, 
we know that after keeping the Z-th type of coupon, we have at least c 2 + 1 copies of 
Z-th type of coupons, which satisfies c 2 + 1 > c x + 2. We call the resulting state for the 
kept coupons as A, identified by the tuple (ci, c 2 + 1). Otherwise we would just keep 
the j-th type of coupon, then we will have c\ + 1 coupons of type j and c 2 coupons of 
type I. We call the resulting state in this case as B, identified by the tuple (ci + 1, c 2 ). 
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Now we argue that to collect m sets of coupons, on average starting from state B 
will take no longer time than starting from state A. The main idea is to let the collector 
starting from B follows the strategy of the collector starting from A, and do no worse 
in the expected delay. 

Let us consider two coupon collectors, one starting from state A and the other 
starting from state B. At each time index, these two collectors get the same d coupons. 
A coupon collector will keep one coupon only if he has fewer than m copies of coupons 
of that type. Suppose that the coupon collector starting from state A follows his 
optimized "keeping" decision such that his expected time to fully collect m sets of 
coupons is minimized. Then we let the coupon collector starting from state B follow 
the same decision process as the coupon collector starting from state A, until some 
time index when the coupon collector starting from state A decides to keep a coupon 
of type j or type At that time index, we also let the coupon collector starting from 
state B keep the same type of coupon as the coupon collector starting from state A 
does. Then let {c' 1; c' 2 } denote the resulting numbers of kept coupons of type j and type 
I, for the collector starting from state A; and likewise define {c'{,c 2 } for the collector 
starting from state B. 

Now we can take an inspection of state {c'^c^} and state {c'{,c 2 }. There are two 
scenarios to discuss separately. 

In the first scenario, c 2 = C\ + 1 and it is the coupon of type j that the coupon 
collector decides to keep at that time index. Then {c'^c^} = {ci + 1, ca + 1} and 
{c'{, c 2 } = {ci + 2, c 2 } = {c 2 + 1, Ci + 1}. Apparently {c[, c' 2 } and {c", c 2 '} are just the 
permutations of each other, and so by symmetry, the optimized time from these two 
new states to the completion of collecting m sets of coupons will be the same. 

In the second scenario, we have — c^+c^ > —c'l + c^ > 0. In this scenario, we update 
state A as {c'^c^} and update state B as {c'{,c 2 }\ and then construct an iterative 
process of evolving states A and B as follows. (To keep the notations consistent, at the 
beginning of a new iteration, we always represent state A and state B by {c' 1; c' 2 } and 
{c",^'}, even though they are different numbers than in previous iterations. We also 
remark that during these iterations, Ci,c' 2 ,Ci,c 2 always satisfy the listed constraints 
([5|)- (fT2")l . which will be obvious from the iterative process description.) 

At the beginning of each iteration, we have two coupon collectors starting from 



8 



Weiyu Xu, A. Kevin Tang 



states A and B respectively. At each time index, the coupon collector starting from 
state B will keep the same type of coupon as the coupon collector starting from state 
A, until some run they keep a coupon of either type j or type I. Again, we have two 
cases to consider. 

In the first case, c'l = c 2 and it is the coupon of type j that the coupon collectors 
decide to keep at that time index. After keeping that coupon, we have states {c^ + 1, c' 2 } 
and {c'l + l,c 2 } for the collectors starting from A and starting from B, respectively. 
So if c'{ + 1 = c' 2 , then by symmetry, {c'l + 1, c 2 } and {c[ + l,c' 2 } are two equivalent 
states, and we are done (note that c[ + c' 2 = c'l + c 2 ) ; otherwise, if c" + 1 < c 2 , we 
update state A and B as {c[ + 1, c' 2 } and {c 2 , c'l + 1} respectively. 

In the second case, after keeping that new coupon of type j or type I, we simply 
update state A and state B respectively to record the new numbers of type j and type I 
coupons for these two coupon collectors. Then we go back to the beginning of another 
iteration. 

Note in all these iterations, we always maintain 

c' l+ c' 2 = c'l + 4, (8) 
c'i < 4 (9) 

c'l < 4', (io) 

c' 2 -c\ > 4 -c'l, (11) 
Ci, c 2 ,c 1 ,c 2 < m. (12) 

Because of this, in each iteration, when the coupon collector starting from A can keep 
a certain type of coupon, the coupon collector starting from state B can also keep the 
same type of coupon. 

Because for every iteration, we will increase c[+c' 2 and c"+c 2 , and c[, c' 2 , c'l, c 2 < m, 
if we iterate the previous processes, we will eventually run into a pair of symmetric 
states in some iteration for these two coupon collectors. 

Since we can always end up in a symmetric state for the two coupon collectors, 
starting from B to collect m sets of coupons will not take longer time than starting 
from A. 

□ 
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In fact, the previous arguments can essentially show that starting from a state 
{c[, c' 2 } has no bigger expected collection time than from the state {c'{, c' 2 ' }, if \c[ — c' 2 \ < 
\c'{ - c' 2 '\ and c[ + c' 2 = c'{ + c' 2 '. 

At this point, we are ready to present the following upper bound for E(Df n n ). 

Theorem 3. Suppose that the coupon collector is given d uniformly randomly chosen 
d distinct coupons and he is only allowed to keep one out of these d distinct coupons. 
Then the expected time EiT)^ n ) < -p^ — h mn(l — 1/d). 

Proof. From Theorem [2] and [TJ we will consider an upper bound on the expected 
finishing time if each run the coupon collector is given d independently chosen coupons 
(allowing repeating) and he decides to keep only the first "useful" coupon when it is 
available among the d coupons. Here a kept "useful" coupon means this coupon is kept 
before the coupon collector has m copies of that type of coupon. 

The idea of the proof is to upper bound the expected finishing time conditioning on 
a a specific sequence of kept coupons, until we have m sets of coupons. By a specific 
sequence of "keeper" coupons, we mean a sequence of kept "useful" coupons, specified in 
their types and the order of keeping them. We note that a specific sequence of "keeper" 
coupons contain mn coupons. 

First we make a key observation about the probability that the coupon collector 
follows a specific sequence of "keeper" coupons. This probability will be the same as 
the corresponding probability that he follows this specific sequence of kept coupons, in 
another collection process where each run he is only offered 1 instead of d independently, 
uniformly chosen coupons. This is because, when the coupon collector is offered d 
coupons, he still checks them one by one and only keeps the first one that is "useful". 

Now, conditioning on a specific sequence of kept coupons, we are interested in the 
expected time to collect that sequence. Suppose that right after the r-th coupon in 
this "keeper" sequence has just been kept, there are s types of coupons for which the 
coupon collector have m copies. We then want to know what is the expected number 
of runs needed to collect the next (r+ l)-th "keeper" coupon, conditioning on the whole 
"keeper" coupon sequence is known. 

Through the conditional event that the "keeper" sequence has already specified, it is 
a standard exercise to show that the average time the coupon collector takes to collect 
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the (r + l)-th "keeper" coupon will be 

E' 



i - (i - ^ ) d 

In fact, conditioned on the fact that the next "keeper" coupon is already specified, 
unless the next inspected coupon belongs to the s types of coupons for which the 
collector has already had m copies, it must be the (r + l)-th "keeper" coupon. So 
conditioned on the already specified next "keeper" (the same as conditioned on the 
whole "keeper" sequence is known, because the collecting process is a Markov chain), a 
uniformly chosen coupon is the specified "keeper" with probability . So conditioned 
on the (r + l)-th "keeper" is known, with probability (1 — s=s -) , none of the d uniformly 
chosen (allowing repeating) coupons is the known (r + l)-th "keeper". 

However, we note that if the coupon collector is only offered 1 instead of d coupons 
each time, then the expected time to get the (r + l)-th coupon in that sequence on 
average will be 

E" = -2-. 

n — s 

By Lemma [5] proven latter than, we know that 

E' - E"/d < 1 - ~ 

for any 1 < s < n. 

Since there are exactly mn coupons in any specific sequence of "keeper" coupons, 
the total expected time E' s for collecting a whole "keeper" sequence S, when each time 
the coupon collector has d coupons (allowing repeating) to choose from, and the total 
expected time Eg for collecting the same "keeper" sequence S, when each time the 
coupon collector only has 1 incoming random coupon, satisfy 

E s -E'^/d<mn(l--). (13) 

By invoking the fact that the coupon collector follows any specific sequence of 
"keeper" coupons with the same probability for both the d = 1 case and the d ^ 1, 
(TT3")) implies 

E(D d m ,J < + mn(l - l/d), (14) 

for any d, which is exactly the theorem statement. □ 
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Lemma 2. Function f(i) = — ±- )d is decreasing for 1 < i < n; and, i — 1 < 

f(i) < for 1 < i < n. 

Proof. We need to show that the derivative 



f'(i) 



»(i - a - i) d y rf -* 2 

for i 6 [1, n] . 
Let 

#(x) = (1 - x) d + d • x(l - x)^ . 

So 

g'{x) = ~d{l - x)^ (x + d • x + 2(1 - x)^ - 2] 

Also let 

h(x) = x + d ■ x + 2(1 - x)^ - 2. 

So 

h'(x) = 1 + d - (1 + d)(l - x)^ > 0, 
and /i(x) > h(0) = for x > 0. Because 

1 d-3 

s'(x) = -^^(i - x) — h{x) < for < x < 1, 

we have 

< g(0) = 1 for < x < 1. 

This translates into 

1 - (1 - x) d > d • x(l - x)^ f or o < x < 1, 



so 



' < tt^ rv~r for < x < 1. 



Namely 



(1 - (1 -x) d ) 2 ~ d 2 -x 2 (l -x)^ 1 
d(l-x) d_1 1 



Plugging in x = ^, we have 

d(l - 



, ,xTT < -: — o for 1 < « < n. 
n (l _ (1 _ i)d)2 " d-*2 " " 
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So 

f'(i) < for 1 < i < n. 

Calculating f(n), we have 

n 1 1 

> — — 1 lor 1 < i < n. 



d-i 1 - (1 - i) d ~ d 

□ 

5. An Asymptotic Analysis (n — > oo) 

In this section, we will give an asymptotic analysis of the upper and lower bounds 
for E{Df n n } and see how it behaves asymptotically for fixed d and m as n goes to oo. 
We will begin with an asymptotic analysis through an exact expression for E{Df n }. 

Theorem 4. When n is large enough and d > 1, 

EP?,„] =»(^+0(l)) (15) 



Proof. 



> 



E[DU 

n— 1 n— 1 j 

i=0 1 - t=0 1 n(n-l)...(n-d+l) 

71— 1 ^ 

rf +E — — ^ 



ri—a ^ 

= d + E — ; r^r- 

4=1 i-(i-^t) 

Since (1 - x) d > 1 - da; for 1 < x < 1, 

B(I> lin ) > ^ — - 2. 



d-i nd 4^ n — i 



as n — >■ oo. 
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E(D 1>n ) ^2 1 i(i-i)...(i-d+i) 



2=0 1 n(ra-l)...(ra-d+l) 
n-1 



»=i 

= i£7(i?J jn )+„(i-i). 



□ 



Theorem 5. When m is fixed, then for any d > 1, 

y E(D d m J-nlog(n)/d 

hm -p— ■ — — r— — r— = 1 (16) 

n-toc [n(rn — 1) log log n)/d 

Proof. From the lower bound and upper bound for E(D^ n ) in Section [3] and Section 
[4J we know 

E(Dfn.n) 1 



lim 



E(DIJ d' 



Then the asymptotic expression emerges immediately by recalling the asymptotic 
expression for B(D^ n ). □ 

6. An Algorithmic Approach (for any finite n) 

In this section, we will give an algorithm which calculates exactly E(D^ n ) for 
specified m, n and d based on a state-space representation of the Markov process of 
collecting the coupons. For each no, fix, ri2, ■ ■ ■ , n m > satisfying hq+ui + - ■ ■+n m = ri, 
define S m — (n ,ni, ■ ■ ■ ,n m ) to be the state where (0 < i < m) is the number 
of coupons that the coupon collector has collected i times. Hence, E(D^ nn ) is the 
expected number of runs for the coupon collector to go from state (n, 0, . . . , 0) to state 
(0,...,0,n). 



14 



Weiyu Xu, A. Kevin Tang 



We now provide an algorithm to calculate E{D'^ n n ). Define 
<{S m ) 

= starting from state S m , the number of runs after 
which to completed sets of coupons have been 
collected, i.e., the number of runs from state S m 
to (0, • • • ,0,n) 

Clearly, 

JV£(n,0,---,0) = lft in) (17) 
JV£(0,---,0,n)=0, (18) 

Suppose we are at state S m — (no,ni, • ■ ■ ,n m ). After one run, the transition 
probability from S m to the following two states are as follows (w.p. is abbreviation for 
"with probability"): 

(n ,ni,-- - ,n m ) w.p. - - 



(n , • • • , rii - 1, n l+ i + !,••• , n m ) 



Q) 

w.p. Pi 

< i < to 



where ft =((E=j.-«)-(^i-*))/( 3 ). 
Therefore, we have the equation 



E[Ni{n Q ,--- ,n m )} 

l+M/f^ xE[Ni(n ,..-,n m )] 



+ ^2 p l E[N% l (n , ■■■ ,ru- l,n,+i + !,••• ,n m )]. 



j=0 



So 



£7[JV^(no," - ,n m )] 
(3) - (7) 

5 C TO - ) 

x£ , [^(n , • • • , rii - l,n i+ i + !,-••, n m )]. 
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Define map $ : {(no, • • • , n m ) : rig, n%, ■ ■ ■ , n m > 0, rig + m + ■ ■ ■ + n m = n} — > N, 
where 

$(n ,ni,n 2 , • • ■ ,n m ) = ^(1 + n) m_l rii. 

i=0 

Obviously, $ is an injection and 

$(n, 0,-- - ,0) 
$(0,... ,0,n) 

Since 

$(no, • • • ,n m ) 

- $(n , • • • , n< - 1, + 1, • • ■ , n m ) 

= ((l + n)™-V: + (l + n) m - J - 1 n l+1 ) 

-((1 + n) m -*(ni - 1) + (1 + n)" 1 "*- 1 ^! + 1)) 
= (1 + n) m_i - (1 + n)™^" 1 
> 0, 

by (|19p , the expected number of runs from a state S only depends on the expected num- 
ber of runs from states S*'s with $(5*) < $(5). Therefore, we can order all the states 
(no, • • • , n m ) according to the value of &(riQ, • • • , n m ) ail d compute E[Nm(no, • • • , n m )} 
one by one, from the starting state (0, ■ ■ • ,0, n) to the last state (n, 0, • • ■ ,0). The 
algorithm is described in Algorithm [TJ 

Since the number of non-negative integer solutions to the equation no + ■ ■ ■ + n m = 
n is ( n+ J n ) , the number of states is ("| t m ) , and the complexity of Algorithm [T] is 

To conclude this section, we now use a simple example (n = 6, m = 2) to illustrate 
Algorithm [TJ When m = 2, each state has 3 parameters n , rii, and n 2 . Since 
uq + tii +n 2 = n, we could draw the state transition diagram as in Figure[JJ Algorithm 
[JJ computes E[Ni[ {no, n%, TI2)] for each state (no,ni,i%2) by the order shown in Figure 
[2] This is right because 

• The expected number of runs from any state only depends on the number of runs 
from its descents in Figure [TJ 



= n ■ (1 + n) 
= n. 
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Algorithm 1 Calculating E^D^ n ) 

for no = to n 
for Til = to n — no 

for n m _i = to n - YT=q 2 n i 
do n m = n - 2^ i=0 «i 
if n m = n 

then E[Ni(n , ■ ■ ■ ,n m )\ =0 
else use equation (|T9|) to compute 
S[A^(no,-- - ,"m)] 

• The computation of ElN* (• • • )] for any state is done after the computations for 
its descents by Figure [2j 

The values of E[N2 (no, ni, 712)] for each state (no, rii, n.2) is shown in Figure[3J 

7. Numerical Examples 

We now engage in numerical exercises to support results in the last two sections, 
i.e. correctness of Algorithm [1] and the derived upper and lower bounds on EiD 1 ^ n ). 

7.1. Algorithm 

First, we give numerical results for the expected collection time when n = 100 and 
rn = 1,2,3 respectively, in three tables. The results show that Algorithm Q] gives an 
expected delay consistent with the simulation results. 



Table 1: m = l,n = 100 



d 


1 


2 


3 


4 


5 


Algorithm 


518.74 


292.93 


220.06 


184.79 


164.27 


Simulation 


518.69 


292.40 


219.33 


184.59 


164.18 
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0,0,6 



1,0,5— >0,1, 5 



2,0,4— >1, 1,4— >0,2,4 



3,0,3^-2,1 ,3— H,2,3— >0,3,3 



4,0,2—3,1,2—2,2,2 



1,3,2 



0,4,2 



5,0,1 



4,1,1 



3,2,1—2,3,1—1,4,1—0,5,1 



6,0,0 — 5,1,0—4,2,0 — 3,3,0 — 2,4,0 — 1,5,0 — 0,6,0 



Figure 1: State transition diagram for n — 6,m = 2. Self-loops are omitted. The nodes are 
labelled with the value of no, Hi, n<i- 



Table 2: m = 2,n = 100 



d 


1 


2 


3 


4 


5 


Algorithm 


728.81 


418.69 


327.02 


286.75 


264.84 


Simulation 


728.20 


419.13 


327.29 


286.81 


264.68 



7.2. Asymptotic Results 

Two cases are considered: (d = 3, to = 1) and (d = 3, to = 2). For each case, three 
lines are plotted. First, the lower bound from Theorem[5]is plotted. Second, the upper 
bound is from Theorem Q] is computed. Finally, both plots are compared against the 
result computed from the algorithm for n from 100 to 500. The results show that the 
upper bound and lower bound bound the expected collecting time very well. In fact, 
when to and d are fixed, the upper bound and lower bound will both scale as ^E^D^ n ) 
asn-> 00. 



Table 3: m = 3,n = 100 



d 


1 


2 


3 


4 


5 


Algorithm 


910.87 


531.34 


428.75 


386.97 


364.86 


Simulation 


910.09 


531.33 


428.72 


386.65 


364.90 
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8 
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25 




23 




20 










27 




26 




24 





3 












10 




4 








16 




11 








21 




17 


n — 


6, m = 



12 



the computation order. The highest node, which represents (0, 0, 6) is labelled because 
Ni(0, 0, 6) is known to be 0. 

8. Conclusion and Future Work 



In this paper, we considered a generalized coupon collector problem where the 
coupon collector needs to collect m > 1 sets of coupons and has the freedom of keeping 
one coupon out of d > 1 coupons offered each time. We obtained asymptotically 
matching upper and lower bounds for the expected collection time. We also provided 
an algorithm to calculate the expected collection time exactly based on a state repre- 
sentation for the coupon collecting process. We should note that asymptotically even if 
the coupon collector is only allowed to keep 1 coupon out of the d coupons, the needed 
time will still be shortened by a factor of d, as if the coupon collector is allowed to 
keep all the d coupons offered each time. 

There is much avenue for future work on this problem. First, one could attempt 
to get a closed-form expression for E(D^ n ). Second, one could attempt to improve 
Algorithm 1. Algorithm 1 has a runtime of (™~^ m )- To take advantage of this runtime 
requires constant time indexing. The direct approach is to index the states in an n- 
dimensional matrix of size (n + l) m + 1. However, since there are a total of (™^ m ) 
states, a large fraction of the matrix space is not required. Hence, it would be helpful 
to find an algorithm which carries out triangular indexing in constant time. This would 
reduce the memory requirements and increase the range of parameters over which the 
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0.00 



6.00 




3.00 












8.59 




6.93 




4.67 










10.46 




9.21 




7.76 


5.92 



12.09 




11.02 




9.86 




8.55 




6.99 
















13.68 




12.68 




11.65 




10.56 




9.36 




7.99 













15.44 




14.44 




13.44 




12.43 




11.37 




10.24 




8.99 















Figure 3: State transition diagram for n = 6, m = 2. The nodes are labelled by 
E[Nl(no,ni,n 2 )]. 




Figure 4: Asymptotics for m = 1, d = 3. 




j- ^ ^ ± — 

m=2,d=3 



Figure 5: Asymptotics for m — 2, d = 3. 
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problem is computationally feasible. One could further observe that although there 
are (™^ m ) states, only ("^"j 1 ) are actually needed at any time. So with constant 
time triangular indexing, one can reduce the memory requirements further although 
the gain from the second reduction is minimal. 
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